Context Sensitive Verb Similarity Dataset for Legal Information Extraction
نویسندگان
چکیده
Existing literature demonstrates that verbs are pivotal in legal information extraction tasks due to their semantic and argumentative properties. However, granting computers the ability interpret meaning of a verb its properties relation given context can be considered as challenging task, mainly polysemic domain specific behaviours verbs. Therefore, developing mechanisms identify behaviors evaluate how artificial models detect with significant importance. In this regard, comprehensive dataset used an evaluation resource, well training data set, major requirement. paper, we introduce LeCoVe, which is similarity intended towards facilitating process identifying similar meanings context. Using dataset, evaluated both generic embedding models, were developed using state-of-the-art word representation language modelling techniques. As part experiments carried out announced Sense2Vec BERT trained corpus opinion texts order capture behaviours. addition demonstrate neural network model, was by combining semantic, syntactic, contextual features obtained from outputs perform comparatively well, even low resource scenario.
منابع مشابه
Crowdsourcing a Large Dataset of Domain-Specific Context-Sensitive Semantic Verb Relations
We present a new large dataset of 12403 context-sensitive verb relations manually annotated via crowdsourcing. These relations capture fine-grained semantic information between verb-centric propositions, such as temporal or entailment relations. We propose a novel semantic verb relation scheme and design a multi-step annotation approach for scaling-up the annotations using crowdsourcing. We emp...
متن کاملBootstrapping a Verb Lexicon for Biomedical Information Extraction
The extraction of information from texts requires resources that contain both syntactic and semantic properties of lexical units. As the use of language in specialized domains, such as biology, can be very different to the general domain, there is a need for domain-specific resources to ensure that the information extracted is as accurate as possible. We are building a large-scale lexical resou...
متن کاملTemporal information extraction from legal documents
The aim of this paper is to analyze what kinds of temporal information can be found in different types of legal documents. In particular, it provides a comparison of different legal document types (case law, statute or transactional document) and how one can do further reasoning with the extracted temporal information.
متن کاملConcept and Context in Legal Information Retrieval
There exist two broad approaches to information retrieval (IR) in the legal domain: those based on manual knowledge engineering (KE) and those based on natural language processing (NLP). The KE approach is grounded in artificial intelligence (AI) and case-based reasoning (CBR), whilst the NLP approach is associated with open domain statistical retrieval. We provide some original arguments regar...
متن کاملTowards context sensitive information inference
Humans can make hasty, but generally robust judgements about what a text fragment is, or is not, about. Such judgements are termed information inference. By drawing on theories from non-classical logic and applied cognition, an information inference mechanism is proposed which makes inferences via computations of information flow through a high dimensional conceptual space. Within a conceptual ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Data
سال: 2022
ISSN: ['2306-5729']
DOI: https://doi.org/10.3390/data7070087